A Text Retrieval System Based on Distributed Representations
نویسندگان
چکیده
Most text retrieval systems are essentially based on bagof-words (BOW) text representations. Despite popularity of BOW, it ignores the internal semantic meanings of words since each word is treated as an atomic unit. Recently, distributed word and text representations become increasingly popular in NLP literatures. They embed syntactic and semantic information of words and texts into low-dimensional vectors, thus overcome the weaknesses of traditional BOW representations to some extent. In this paper, we implement a text retrieval system that are totally supported by distributed representations. Our new system no longer relies on the matchings of words in queries and texts, but uses semantic similarity to judge if a text is relevant to a query and to what extent, which provides better user experience compared with traditional text retrieval systems.
منابع مشابه
Image retrieval using the combination of text-based and content-based algorithms
Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...
متن کاملSemiautomatic Image Retrieval Using the High Level Semantic Labels
Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...
متن کاملLearning to Match using Local and Distributed Representations of Text for Web Search
Models such as latent semantic analysis and those based on neural embeddings learn distributed representations of text, and match the query against the document in the latent semantic space. In traditional information retrieval models, on the other hand, terms have discrete or local representations, and the relevance of a document is determined by the exact matches of query terms in the body te...
متن کاملScientific Article Recommendation by using Distributed Representations of Text and Graph
Scientific article recommendation problem deals with recommending similar scientific articles given a query article. It can be categorized as a content based similarity system. Recent advancements in representation learning methods have proven to be effective in modeling distributed representations in different modalities like images, languages, speech, networks etc. The distributed representat...
متن کاملAggregation-Based Structured Text Retrieval
DEFINITION Text retrieval is concerned with the retrieval of documents in response to user queries. This is achieved by (i) representing documents and queries with indexing features that provide a characterisation of their information content, and (ii) defining a function that uses these representations to perform retrieval. Structured text retrieval introduces a finer-grained retrieval paradig...
متن کامل